On Evaluating Decentralized Parallel I/O Scheduling Strategies for Parallel File Systems
نویسندگان
چکیده
This paper evaluates the impact of the parallel I/O scheduling strategy on the performance of the file access in a parallel file system for clusters of commodity computers (Clusterfile). We argue that the parallel I/O scheduling strategy should be seen as a complement to other file access optimizations like striping over several I/O servers, non-contiguous I/O and collective I/O. Our study is based on three simple decentralized parallel I/O heuristics implemented inside Clusterfile. The measurements in a real environment show that the performance of parallel file access may vary with as much as 86% for writing and 804% for reading with the employed heuristic and with the schedule block granularity.
منابع مشابه
Distributed Scheduling Algorithms to Improve the Performance of Parallel Data Transfers 1 By
The cost of data transfers and in particular of I O operations is a growing problem in parallel computing This performance bottleneck is especially severe for data intensive appli cations such as multimedia information systems databases and Grand Challenge problems A promising approach to alleviating this bottleneck is to schedule parallel I O operations explicitly Although centralized algorith...
متن کاملvPFS: Bandwidth Virtualization of Parallel Storage Systems
This paper presents vPFS, a new parallel file system performance management approach to support the allocation of shared storage bandwidth on a per-application basis. Existing parallel file systems are unable to differentiate I/O requests from different applications and meet per-application bandwidth requirements. This limitation presents an increasing hurdle for applications to achieve their d...
متن کاملGreen Energy-aware task scheduling using the DVFS technique in Cloud Computing
Nowdays, energy consumption as a critical issue in distributed computing systems with high performance has become so green computing tries to energy consumption, carbon footprint and CO2 emissions in high performance computing systems (HPCs) such as clusters, Grid and Cloud that a large number of parallel. Reducing energy consumption for high end computing can bring various benefits such as red...
متن کاملSimulation of Parallel Programs on Application and System Level
Computer simulation revolutionizes traditional experimentation providing a virtual laboratory. The goal of high-performance computing is a fast execution of applications since this enables rapid experimentation. Performance of parallel applications can be improved by increasing either capability of hardware or execution efficiency. In order to increase utilization of hardware resources, a rich ...
متن کاملImproving Parallel I/O Performance through Scheduling Algorithms based on Randomized, Distributed Edge Coloring Algorithms∗
A growing imbalance in CPU and I/O speeds has led to a communications bottleneck in distributed architectures, especially for data-intensive applications such as multimedia information systems, databases, and Grand Challenge problems. Our solution is to schedule parallel I/O operations explicitly. We present a class of decentralized scheduling algorithms that eliminate contention for I/O ports ...
متن کامل